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Abstract. Robust search procedures are a central component in the de- 
sign of black-box constraint-programming solvers. This paper proposes 
activity-based search, the idea of using the activity of variables during 
propagation to guide the search. Activity-based search was compared 
experimentally to impact-based search and the wdeg heuristics. Ex- 
perimental results on a variety of benchmarks show that activity-based 
search is more robust than other heuristics and may produce significant 
improvements in performance. 

1 Introduction 

Historically, the constraint-programming (CP) community has focused on devel- 
oping open, extensible optimization tools, where the modeling and the search 
procedure can be specialized to the problem at hand. This focus stems partially 
from the roots of CP in programming languages and partly from the rich mod- 
eling language typically found in CP systems. While this flexibility is appealing 
for experts in the field, it places significant burden on practitioners, reducing its 
acceptance across the wide spectrum of potential users. In recent years however, 
the constraint-programming community devoted increasing attention to the de- 
velopment of black-box constraint solvers. This new focus was motivated by the 
success of Mixed-Integer Programming (MIP) and SAT solvers on a variety of 
problem classes. MIP and SAT solvers are typically black-box systems with au- 
tomatic model reformulations and general-purpose search procedures. As such, 
they allow practitioners to focus on modeling aspects and may reduce the time 
to solution significantly. 

This research is concerned with one important aspect of black-box solvers: 
the implementation of a robust search procedure. In recent years, various pro- 
posals have addressed this issue. Impact-based search (IBS) [10] is motivated by 
concepts found in MIP solvers such as strong branching and pseudo costs. Sub- 
sequent work about solution counting can be seen as an alternative to impacts 
[8] that exploits the structure of CP constraints. The weighted-degree heuristic 
(wdeg) [1] is a direct adaptation of the SAT heuristic VsiDs[5] to CSPs that 
relies on information collected from faihircs to define the variable ordering. 

This paper proposes Activity-Based Search (Abs), a search heuristic that 
recognizes the central role of constraint propagation in constraint-programming 



systems. Its key idea is to associate with each variable a counter which measures 
the activity of a variable during propagation. This measure is updated system- 
atically during search and initialized by a probing process. Abs has a number of 
advantages compared to earlier proposals. First, it does not deal explicitly with 
variable domains which complicates the implementation and runtime require- 
ments of IBS. Second, it does not instrument constraints which is a significant 
burden in solution-counting heuristics. Third, it naturally deals with global con- 
straints, which is not the case of WDEG since all variables in a failed constraint 
receive the same weight contribution although only a subset of them is relevant 
to the conflict. Abs was compared experimentally to Ibs and wdeg on a variety 
of benchmarks. The results show that Abs is the most robust heuristic and can 
produce significant improvements in performance over Ibs and WDEG, especially 
when the problem complexity increases. 

The rest of the paper is organized as follows. Sections 2 and 3 review the IBS 
and WDEG heuristics. Section 4 presents Abs. Section 5 presents the experimental 
results and Section 6 concludes the paper. 



2 Impact-Based Seairch 

Impact-based search was motivated by the concept of pseudo-cost in MIP solvers. 

The idea is to associate with a branching decision x = v a measure of how 
effectively it shrinks the search space. This measure is called the impact of the 
branching decision. 

Formalization Let P = {X, D, C) be a CSP defined over variables X, domains 
D, and constraints C. Let D{xi) denote the domain of variable Xi & X and 
|£>(a;8)| denote the size of this domain. A trivial upper-bound on the size of the 
search space of S{P) is given by the product of the domain sizes: 

s{p) = n 1^(^)1 

At node fc, the search procedure receives a CSP Pk-i = {X,Dk_i,Ck-i), where 
Cfe_i = CU {co, ci, C2, • • • , Cfe_i} and Ci is the constraint posted at node i. 
Labeling a variable x with value a G Dk-i{x) adds a constraint a; = a to Ck-i 
to produce, after propagation, the CSP Pk = {X,Dk,Ck)- 

The contraction of the search space induced by a labeling a; = a is defined as 

I(x = a) = 1 



SiPk-i) 

I{x = a) = 1 when the assignment produces a failure since S{Pk) = and 
I{x = a) « whenever S{Pk) « <S(Pfe_i), i.e., whenever there is almost no 
domain reduction. An estimate of the impact of the labeling constraint x = a 
over a set of search tree nodes fC can then be defined as 

V 1 _ 



Actual implementations (e.g., [7]) rely instead on 



Io{x = a) ■ {a- 1) + I{x = a) 

Ii (x = a) = — ^ — ^ ^ 

a 

where a is a parameter of the engine and the subscripts in Iq and Ji denote the 
impact before and after the update. Clearly, a = 1 yields a forgetful strategy, 
a = 2 gives a running average that progressively decays past impacts, while a 
choice a > 2 favors past information over most recent observations. 
The (approximate) impact of a variable x at node k is defined as 

I{x) = Yl ^-I{x = a) 

To obtain suitable estimates of the assignment and variable impacts at the root 
node, IBS simulates all the J2xex 1-^(^)1 possible assignments. For large domains, 
domain values are partitioned in blocks. Namely, for a variable x, let D{x) = 
U^^iBi with Bi O Bj = (d Vj,j : i j E l..b. The impact of a value a E Bi 
{i € 1..6) is then set to I{x = a) = I{x G Bi). With partitioning, the initialization 
costs drop from |£'(x)| propagations to b propagations (one per block). The 
space requirement for IBS is Oi^^^^ l-^(^)l)> since it stores the impacts of all 
variable/ value pairs. 

The Search Procedure IBS defines a variable and a value selection heuristic. IBS 
first selects a variable x with the largest impact, i.e., x S argMax^£j(-I(a;). It then 
selects a value a with the least impact, i.e., a G argMin^g£,(^)7(a; = v). Neither 
argMax^g^Z(a:;) nor argMin^g£,(j.-,/(a; = v) are guaranteed to be a singleton and, 
in case of ties, Ibs breaks the ties uniformly at random. 

As any randomized search procedure, IBS can be augmented with a restart 
strategy. A simple restarting scheme limits the number of failures in round i to 
li and increases the limit between rounds to Zj+i = p - k where p> 1. 

3 The WDEG Heuristic 

WDEG maintains, for each constraint, a counter (weight) representing the num- 
ber of times a variable appears in a failed constraint, i.e., a constraint whose 
propagation removes all values in the domain of a variable. The weighted degree 
of variable x is defined as 

C(wdeg{x) = weigktlc] s.t. X e vars{c)x A \FutVars{c)\ > 1 

where FutVars{c) is the set of uninstantiated variables in c. 

WDEG only defines a variable selection heuristic: It first selects a variable x 
with the smallest ratio . AH the weights are initialized to 1 and, when a 

constraint fails, its weight is incremented. The space overhead of WDEG is ©(jCI) 
for aCSP {X,D,C). 



4 Activity-Based Search 



Abs is motivated by the key role of propagation in constraint-programming 
solvers. Contrary to SAT solvers, CP uses sophisticated filtering algorithms to 
prune the search space by removing values that cannot appear in solutions. Abs 
exploits this filtering information and maintains, for each variable x, a measure 
of how often the domain of x is reduced during the search. The space requirement 
for this statistic is ^dXj). Abs can optionally maintain a measure of how much 
activity can be imputed to each assignments a; = a in order to drive a value- 
selection heuristic. If such a measure is maintained, the space requirement is 
proportional to the mimber of distinct assignments performed during the search 
and is bounded by 0{^^^^ \D{x)\). Abs relies on a decaying sum to forget the 
oldest statistics progressively, using an idea from VsiDS. It also initializes the 
activity of the variables by probing the search space. 

Abs is simple to implement and does not require sophisticated constraint 
instrumentation. It scales to large domains without special treatment and is 
independent of the domain sizes when the value heuristic is not used. Also, Abs 
does not favor variables appearing in failed constraints, since a failure in a CP 
system is typically the consequence of many filtering algorithms. 

Formalization Given a CSP P = {X,D,C), a CP solver applies a constraint- 
propagation algorithm F after a labeling decision. F produces a new domain 

store D' C D enforcing the required level of consistency. Applying F to P 
identifies a subset X' C X oi affected variables defined by 

WxeX' : D'{x) C D{x); 
\/x(zX\X' : D'{x) = D{x). 

The activity of x, denoted by A{x), is updated at each node k of the search tree 
by the following two rules: 

Vx e X s.t. [^(a;)! > 1 : A{x) = A{x) ■ 7 
Va; e X' : a'{x) = A{x) + 1 

where X' is the subset of affected variables and 7 be an age decay parameter 
satisfying < 7 < 1. The aging only affects free variables since otherwise it 
would quickly erase the activity of variables labeled early in the search. 

The activity of an assignment a; = a at a search node k is defined as the 
number of affected variables in \X'\ when applying F on C U {x = a}, i.e., 

Ak{x = a) = \X'\. 

As for impacts, the activity of a; = a over the entire tree can be estimated by 
an average over all the tree nodes seen so far, i.e., over the set of nodes K. The 
estimation is thus defined as 



Once again, it is simpler to favor a weighted sum instead 



J[ /j, = a) = Mx = a)-{a-'^) + Mx = a) 

a 

where the subscripts on A capture the estimate before and after the update. 
When the value heuristic is not used, it is not necessary to maintain A{x = a) 
which reduces the space requirements without affecting variable activities. 

The Search Procedure Abs defines a variable ordering and possibly a value or- 
dering. It selects the variable x with the largest ratio j^|^7 i-e., the most ac- 
tive variable per domain value. Ties are broken uniformly at random. When 
a value heuristic is used, Abs selects a value a with the least activity, i.e., 
a G argMin^g£)(j.-) A(.x = v). The search procedure can be augmented with 
restarts. The activities can be used "as-is" to guide the search after a restart. It 
is also possible to reinitialize activities in various ways, but this option was not 
explored so far in the experimentations. 

Initializing Activities Abs uses probing to initialize the activities. Consider a 
path TT going from the root to a leaf node fc in a search tree for the CSP P = 
{X,D,C). This path n corresponds to a sequence of labeling decisions {xq = 
vo,xi = vi, - ■ ■ ,Xk = Vk) in which the j*^ decision labels variable xj with vj G 
Dj{xj). If Xj C X is the subset of variables whose domains are filtered as a 
result of applying F after decision Xj = Vj , the activity of variable x along path 
TT is defined as A^{x) = A^{x) where 

AUx) = 

A]{x) = A]_^{x) + l-^x e {l<j <k) 
A]{x) = A]_-^{x) ^x^Xj{l<j< k) 

A^ {x) = if X was never involved in any propagation along tt and A^{x) = fc if 
the domain of x was filtered by each labeling decision in tt. Also, A^(x) = A{x) 
when 7=1 (no aging) and path tt is followed. 

Now let us now denote 77 the set of all paths in some search tree of P. Each 
such path n £ n defines an activity A'^{x) for each variable x. Ideally, we would 
want to initialize the activities of x as the average over all paths in 77, i.e., 

M.) = ^^-^. 

Abs initializes the variables activities by sampling 77 to obtain an estimate of 
the mean activity iIa{x) from a sample 77 C 77. More precisely, Abs repeatedly 
draws paths from 77. These paths are called probes and the j*'* assignment 
Xj = Vj in a probe p is selected uniformly at random as follows: (1) Xj is a free 
variable and (2) value Vj is picked from Dj{xj). During the probe execution, 
variable activities are updated normally but no aging is applied in order to ensure 
that all probes contribute equally to fiAix). Observe that some probes may 



terminate prematurely since a failure may be encountered; others may actually 
find a solution if they reach a leaf node. Moreover, if if a failure is discovered at 
the root node, singleton arc-consistency [9] has been established and the value 
is removed from the domain permanently. 

The number of probes is chosen to provide a good estimate of the mean 
activity over the paths. The probing process delivers an empirical distribution 
A{x) of the activity of each variable x with mean ^a{x) and standard deviation 
(74 (.x). Since the probes are iid, the distribution can be approximated by a normal 
distribution and the probing process is terminated when the 95% confidence 
interval of the t-distribution, i.e., when 



is sufficiently small (e.g., within 5% of the empirical mean) for all variables x 
with n being the number of probes, 

Observe that this process does not require a separate instrumentation. It uses 
the traditional activity machinery with 7 = 1. In addition, the probing process 
does not add any space requirement: the sample mean ^a{x) and the sample 
standard deviation (Ja{x) are computed incrementally, including the activity 
vector for each probe as it is completed. If a value heuristic is used the 
sampling process also maintains A{x = a) for every labeling decision x = a 
attempted during the probes. 

5 Experimental Results 
5.1 The Experimental Setting 

The Configurations All the experiments were done on a Macbook Pro with 
a core i7 at 2.66Ghz running MacOS 10.6.7. Ibs, wdeg, and Abs were were 
all implemented in the Comet system [3]. Since the search algorithms are in 
general randomized, the empirical results arc based on 50 runs and the tables 
report the average (/xt) and the standard deviation ctt of the running times 
in seconds. Unless specified otherwise, a timeout of 5 minutes was used and 
runs that timeout were assigned a 300s runtime. The following parameter values 
were used for the experimental results: a = 8 in both IBS and Abs, 7 = 0.999 
(slow aging), and 5 = 20%. Experimental results on the sensitivities of these 
parameters will also be reported. For every heuristic, the results arc given for 
three strategies: no restarts {NR), fast restarting (p = 1.1) or slow restarting 
(p = 2). depending on which strategy is best across the board. The initial failure 
limit is set to 3 • \X\. 

Search Algorithms The search algorithms were run on the exact same models, 
with a single line changed to select the search procedure. In our experiments, 
Ibs does not partition the domains when initializing the impacts and always 
computes the impacts exactly. Both the variable and value heuristics break ties 



[iIa{x) - to.05,n-l ■ 



cr~A{x) 



l^Aix) + io.05,n-l • 



(ja{x)_ 




randomly. In wdeg, no value heuristic is used: The values are tried in the se- 
quential order of the domain. Ties in the variable selection are broken randomly. 
All the instances are solved using the same parameter values as explained ear- 
lier. No comparison with model-counting heuristic is provided, since these are 
not available in publicly available CP solvers. 

Benchmarks The experimental evaluation uses benchmarks that have been widely 
studied, often by different communities. The multi-knapsack and magic square 
problems both come from the Ibs paper [10]. The progressive party has been 
a standard benchmark in the local search, mathematical-programming, and 
constraint-programming communities, and captures a complex, multi-period al- 
location problem. The nurse rostering problem [11] originated from a mathematical- 
programming paper and constraint programming was shown to be a highly effec- 
tive and scalable approach. The radiation problem is taken from the 2008 MiniZ- 
inc challenge [6] and has also been heavily studied. These benchmarks typically 
exploit many features of constraint-programming systems including numerical, 
logical, reified, element, and global constraints. 

5.2 The Core Results 

Multi-Knapsack This benchmark is from [10] and our implementation follows Re- 
falo's informal description. The satisfaction model uses an arithmetic encoding 
of the binary knapsacks (not a global constraint) where the objective is replaced 
by a linear equality with a right-hand-side set to the known optimal value. All 
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Table 1. Experimental Results on Multi-Knapsack. 



the constraints use traditional bound-consistency algorithms for filtering lin- 
ear constraints. A second set of experiments considers the optimization variant. 
The COP uses n global binary knapsack constraints (binaryKnapsackAtmost in 
Comet) based on the filtering algorithm in [13]. These benchmarks contain up 
to 50 variables. 

Figure 1 is a pictorial depiction of the behavior of the three search algorithms 
with no restarts. The chart on the left gives the absolute running time (in sec- 
onds) with a logarithmic scale. The stacked bar chart on the right reports the 
same data using a relative scale instead where the height of the bar is the nor- 
malized sum of the running time of all three algorithms and the length of each 
segment is its normalized running time. Note that, since this view is not logarith- 
mic, adjacent bars correspond to different totals. The results clearly show that, 
as the difficulty of the problem increases, the quality of wdeg sharply decreases 
and the quality of Abs significantly improves. On the harder instances, Abs is 
clearly superior to IBS and vastly outperforms WDEG. Figure 2 conveys the same 
information for the optimization variant with similar observations. 

Table 1 gives the numerical results for instances 1 — 2 to 1 — 6. The first 

column specifics the instance, while the remaining columns report the average 
run times, the standard deviations, and the number of runs that did not time-out. 
The results are given for no-restart and slow-restart strategies for all heuristics. 
On the decision instance 1 — 6, wdeg often fails to find a solution within the 
time limit and, in general, takes considerable time. Abs always finds solutions 
and is about 5 times faster than les for the no-restart strategy which is most 
effective on the decision variant. On the optimization variant, WDEG cannot solve 
instance 1 — 5 in any of the 50 runs and Ibs does not always find a solution. 
Abs, in contrast, finds a solution in all 50 runs well within the time limit. 

In summary, on this benchmark, WDEG is vastly outperformed by IBS and 

Abs as soon as the instances are not easy. Abs is clearly the most robust heuris- 
tic and produces significant improvements in performance on the most difficult 
instances, both in the decision and optimization variants. 
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Table 2. Experimental Results on Magic Squares. 



Magic Square This benchmark is also from [10] and the model is based on a 
direct algebraic encoding with 2 • n linear equations for the rows and columns 
(the square side is n), 2 linear equations for the diagonals, one alldif f erent 
constraint (not enforcing domain consistency) for the entire square, 2 • n binary 
inequalities to order the elements in the diagonals, and two binary inequalities 
to order the top-left corner against the bottom-left and top-right corners. Table 
2 report results for squares of size 7 to size 10. The F column in Table 2 reports 
the number of successful runs (no timeout). 

On magic squares, WDEG is completely dominated by Ibs and Abs: It has 
poor performance and is not robust even on the simpler instances. The best 
performance for IBS and Abs is obtained using a fast restart, in which case Abs 
provides a slight improvement over IBS. IBS is more effective than Abs with slow 
or no restarts. 

Progressive Party The progressive party problem [12] is a constraint satisfaction 
problem featuring a mix of global constraint and has been used frequently for 
benchmarking CP, LS, and MIP solvers. The instance considered here is the 2 — 8 
instance with 29 guests, 8 periods and 13 hosts, i.e., 232 variables with domains 
of size 13. The goal is to find a schedule for a social event taking place over k 
time periods subject to constraints on the sizes of the venues (the boats), sizes 
of the group, and social constraints (two groups cannot meet more than once 
and one group cannot go back to the same boat more than once). The model 
relies on multiple global alldifferent, multi-knapsacks and arithmetic constraints 
with reifications. This model breaks the search in k phases (one per period) and 
uses the black-box heuristic within each period. 

The results are given in Table 3. Abs clearly dominates IBS on this benchmark 
for all restarting strategies. Abs is also clearly superior to WDEG when no restarts 
are used but is slightly slower than wdeg when slow or fast restarts are used. 
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Table 3. Experimental Results on the Progressive Party 2 — 8. 
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267.58 


0.02 


0.01 


50 




wdeg|R(2) 


4812533.43 


300.00 


0.00 


2 




IBS 1 NR. 


1113.26 


0.07 


0.01 


50 


z4 


Abs 1 NR. 


30221.04 


1.41 


0.09 


50 




IBS R(l.l) 


1028.38 


0.08 


0.01 


50 




Abs|R(1.1) 


257205.36 


11.60 


0.21 


50 




IBS R(2) 


820.52 


0.07 


0.01 


50 




Abs R(2) 


54855.60 


2.53 


0.08 


50 




wdeg|NR 


45043.22 


1.77 


0.08 


50 




IBS|NR 


2794820.62 


118.25 


38.18 


50 




wdeg|R(1.1) 


63783.44 


2.46 


0.17 


50 




IBS R(l.l) 


8241949.70 


300.00 


0.00 


2 




wdeg|R(2) 


47162.36 


1.87 


0.08 


50 




IBS R(2) 


5817294.96 


221.15 


42.75 


47 


z2 


Abs 1 NR. 


45223.02 


2.42 


0.65 


50 




wdeg|NR 


6386541.00 


300.00 


0.00 







Abs R(l.l) 


372174.98 


19.49 


9.03 


50 




wdeg|R(1.1) 


5707406.00 


300.00 


0.00 







Abs R(2) 


98057.72 


5.03 


2.53 


50 




wdeg|R(2) 


5000897.00 


300.00 


0.00 







IBS 1 NR. 


82182.32 


3.84 


0.91 


50 


z5 


Abs 1 NR. 


344187.52 


17.89 


3.91 


50 




IBS R(l.l) 


656035.56 


24.86 


7.60 


50 




Abs|R(1.1) 


3899344.36 


185.81 


38.09 


50 




IBS R(2) 


177432.42 


6.78 


1.96 


50 




Abs R(2) 


902142.38 


43.40 


12.82 


50 




wdeg|NR 


6361685.84 


300.00 


0.00 


1 




IBS 1 NR. 


120571.44 


6.16 


3.61 


50 




wdeg|R(1.1) 


5372380.94 


300.00 


0.00 


3 




IBS R(l.l) 


468481.40 


24.03 


18.69 


50 




wdeg|R(2) 


4944998.26 


300.00 


0.00 


1 




IBS R(2) 


192116.32 


9.84 


5.27 


50 


z3 


Abs|NR 


326902.20 


23.32 


10.88 


50 


z8 


Abs|NR 


59314.68 


3.52 


0.18 


50 




Abs|R(1.1) 


1944533.10 


139.55 


81.15 


50 




Abs|R(1.1) 


599777.70 


36.04 


3.70 


50 




Abs R(2) 


488344.88 


35.26 


25.40 


50 




Abs R(2) 


119224.04 


7.00 


0.53 


50 




IBS|NR 


214032.16 


14.96 


4.45 


50 




IBS|NR 


7204787.00 


273.18 


40.34 


8 




IBS R(l.l) 


893297.88 


62.27 


12.23 


50 




IBS R(l.l) 


8201719.08 


300.00 


0.00 







IBS R(2) 


287935.30 


19.62 


7.01 


50 




IBS R(2) 


4301537.08 


161.61 


54.71 


46 



Table 4. Experimental Results on Nurse Rostering. 



Nurse Rostering This benchmark is taken from [11] and is a rostering prob- 
lem assigning nurses to infants in an hospital ward, while balancing the work- 
load. The multi-zone model can be found in Listing 1.2 in [11]. The custom 
search procedure is removed and replaced by a call to one of the generic searches 
(Ibs,Abs,wdeg). Table 4 reports the performance results for the three heuris- 
tics and 3 restarting strategies on the one-zone instances (zl-z5,z8). Note that 
the custom procedure in [11] relies on a dynamic-symmetry breaking on values 
and sophisticated variable/ value ordering. Results for wdeg beyond z5 are not 
reported as it times out systematically. As before, column F reports the number 
of runs that finish (out of 50), C reports the number of choice points and the T 
columns reports the mean and standard deviation of the running time. 

WDEG exhibits extremely poor performance and robustness on this bench- 
mark. Abs is clearly the most robust procedure as it solves all instances in all its 
runs for all the restarting strategies. It is also significantly faster than IBS on z4 
and z8. Ibs behavcis wc;ll in general, except on z4 and z8 for which it sometimes 
fails to find solutions and takes significantly more time than Abs. It is faster 
than Abs on z3 and z5. 



)^a;x \D{x) 



1x144 + 10x37 + 296x37 
1x178 + 10x37 + 333x37 
1x149 + 10x37 + 333x37 
1x175 + 10x37 + 333x37 
1x233 + 10x50 + 441x50 



Table 5. Description of the Radiation Instances. 



B 


Model 


p(C) 


KT) 


o-(T) 


F 




B 


Model 


m(c) 


M(T) 


<T(T) 


F 


6 


Abs|NR 


14934.94 


1.99 


0.65 


50 




8 


IBS|R(2) 


88117.48 


9.36 


1.34 


50 




Abs|R(1.1) 


10653.36 


1.49 


0.39 


50 






wdeg|NR 


38591.42 


2.90 


0.58 


50 




Abs R(2) 


10768.98 


1.50 


0.44 


50 






wdeg|R(1.1) 


20396.80 


1.72 


0.39 


50 




lBS|NR 


65418.78 


6.89 


0.72 


50 






wdeg|R(2) 


6907.14 


0.55 


0.12 


50 




IBS R(l.l) 


86200.18 


8.60 


0.98 


50 




9 


Abs 1 NR. 


40339.62 


5.79 


3.36 


50 




IBS R(2) 


67003.40 


7.07 


0.70 


50 






Abs|R(1.1) 


20599.88 


3.21 


0.35 


50 




wdeg|NR 


23279.70 


1.77 


0.41 


50 






Abs R(2) 


14101.00 


2.28 


0.51 


50 




wdeg|R(1.1) 


3798.00 


0.30 


0.12 


50 






IBS 1 NR. 


85205.62 


9.70 


0.61 


50 




wdeg|R(2) 


2918.68 


0.23 


0.08 


50 






IBS R(l.l) 


141311.76 


14.40 


3.03 


50 


7 


Abs|NR 


17434.30 


2.73 


1.84 


50 






IBS R(2) 


92431.06 


10.34 


0.60 


50 




Abs|R(1.1) 


8481.62 


1.53 


0.35 


50 






wdeg|NR 


90489.62 


7.33 


1.35 


50 




Abs R(2) 


9229.80 


1.62 


0.51 


50 






wdeg|R(1.1) 


48641.80 


4.49 


1.73 


50 




IBS|NR 


90055.32 


10.42 


0.44 


50 






wdeg|R(2) 


12806.06 


1.20 


0.58 


50 




IBS R(l.l) 


161022.24 


15.93 


6.43 


50 




10 


Abs 1 NR. 


210181.18 


34.56 


17.00 


50 




IBS R(2) 


98742.94 


11.13 


1.73 


50 






Abs|R(1.1) 


102777.38 


17.19 


3.53 


50 




wdeg|NR 


7868.16 


0.65 


0.24 


50 






Abs R(2) 


50346.82 


9.10 


1.65 


50 




wdeg|R(1.1) 


2762.26 


0.24 


0.10 


50 






IBS 1 NR. 


2551543.8 


300.01 


0.00 







wdeg|R(2) 


2824.00 


0.24 


0.12 


50 






IBS R(l.l) 


2504564.1 


300.01 


0.00 





8 


Abs|NR 


33916.58 


4.31 


1.04 


50 






IBS R(2) 


2525199.8 


300.01 


0.00 







Abs R,(l.l) 


48638.90 


6.01 


0.89 


50 






wdeg|NR 


629073.46 


60.09 


39.47 


49 




Abs R(2) 


18294.96 


2.46 


0.52 


50 






wdeg|R(1.1) 


232572.16 


27.88 


2.28 


50 




IBS|NR 


84329.16 


8.98 


1.08 


50 






wdeg|R(2) 


47175.04 


5.60 


1.30 


50 




IBS R(l.l) 


187346.80 


16.94 


4.97 


50 















Table 6. Experimental Results on Radiation Benchmarks. 



Radiation This last benchmark is a constrained optimization problem for radia- 
tion therapy taken from the 2008 MiniZinc challenge [6]. The objective is to find 
a setup of a radiation therapy machine to deliver a desired radiation intensity 
to a tumor. The problem uses algebraic constraint and the formulation can be 
found in the mini-zinc repository [4]^. The search procedure must deal with all 
the variables at once. In 2008, several solvers were unable to solve most instances 
in a reasonable amount of time as seen in [4] , which indicates the difficulty of 
the instances. The instance sizes are specified in Table 5. A row gives a term for 
each array in the problem with its size and the size of the domains. For instance, 
instance 9 has one variable with domain size 175, ten variables with domain size 
37, and 333 variables with domain sizes 37. 

Table 6 reports the results for 5 instances. Abs clearly dominates Ibs on 
all instances and IBS cannot solve the largest instance within the time limit for 
any restarting strategy. WDEG performs well in general on this benchmark. It is 
slightly faster than Abs on the largest instance for the slow and fast restarts. 



^ In this model, the time that the beam is on is a variable and must be optimized 
alongside the number of patterns. 



B 


Method 


m(c) 




a{T) 


S 




B 


Method 


p(C) 


KT) 




S 


6 


Abs|NR 


11224.80 


1.48 


0.58 


50 




9 


Abs 1 NR. 


36534.92 


5.06 


1.18 


50 




Abs|R(1.1) 


18803.18 


2.30 


0.86 


50 






Abs|R(1.1) 


46948.84 


6.76 


1.99 


50 




Abs|R(2) 


12248.46 


1.57 


0.43 


50 






Abs R(2) 


23600.68 


3.46 


1.02 


50 


7 


Abs|NR 


7147.90 


1.27 


0.39 


50 




10 


Abs 1 NR. 


213094.82 


33.70 


9.23 


50 




Abs|R(1.1) 


12161.34 


1.92 


0.68 


50 






Abs R(l.l) 


239145.34 


40.75 


7.55 


50 




Abs|R(2) 


10926.12 


1.74 


0.54 


50 






Abs R(2) 


87626.36 


14.87 


4.14 


50 


8 


Abs|NR 


27702.00 


3.53 


0.78 


50 
















Abs R(1.1) 


63755.24 


7.80 


2.27 


50 
















Abs R(2) 


16708.46 


2.23 


0.47 


50 















Table 7. The Influence of the Value Ordering. 



but is slower with no restarts. On instance 9, it is faster with no restart and 
slower when slow or fast restart are used. Both wdeg and Abs are effective on 
this problem and clearly superior to IBS. 

Summary On this collection of benchmarks, Abs is clearly the most robust 
and effective heuristic. It is robust across all benchmarks and restarting strate- 
gies and is, in general, the fastest heuristic, wdeg has significant robustness 
and performance issues on the multi-knapsack, magic square, and nurse roster- 
ing benchmarks. IBS has some robustness issues on radiation, some rostering 
instances, and the optimization variant of the large knapsack problems. It is 
in general significantly less efficient than Abs on the knapsack, rostering, and 
radiation benchmarks. 

5.3 Sensitivity Analysis 

Criticality of the Variable Ordering Tabic 7 reports the performance of activity- 
based search when no value ordering is used. The value heuristic simply tries 
the value in increasing order as in WDEG. The results indicate that the value 
selection heuristic of Abs docs not play a critical role and is only marginally 
faster/more robust on the largest instances. 

Sensitivity to the Sample Size Figure 3 illustrates graphically the sensitivy of 
Abs to the confidence interval parameter 6 used to control the number of probes 
in the initialization process. The statistics are based on 50 runs of the non- 
restarting strategy. The boxplots show the four main quartilcs for the running 
time (in seconds) of Abs with 6 ranging from 0.8 down to 0.05. The blue line 
connects the medians whereas the red line connects the means. The circles be- 
yond the extreme quartiles are outliers. The left boxplot shows results on msq-10 
while the right one shows results on the optimization version of knapl-4. 

The results show that, as the number of probes increases (i.e., 5 becomes 
smaller) , the robustness of the search heuristic improves and the median and the 
mean tend to converge. This is especially true on knapl-4, while msq-10 still 
exhibits some variance when S = 0.05. Also, the mean decreases with more probes 
on msq-10, while the mean increases on knapl-4 as the probe time becomes 
more important. The value 6 = 0.2 used in the core experiments seem to be a 
reasonable compromise. 



CI 



CI 



Fig. 3. Sensitivity to the Sample Size as Specified by S. 




Fig. 4. Sensitivity to the Aging Process as Specified by 7. 



Sensitivity to 7 (Aging) Figure 4 illustrates the sensitivity to the aging parame- 
ter. The two boxplots are once again showing the distribution of running times 
in seconds for 50 runs of msq-10 (left) and knapl-4 (right). What is not imme- 
diately visible on the figure is that the number of timeouts for msq-10 increases 
from for 7 = 0.999 to 9 for 7 = 0.5. Overall, the results seem to indicate that 
the slow aging process is desirable. 



5.4 Some Behavioral Observations 



Figure 5 depicts the behavior of Abs and Ibs on radiation #9 under all three 
restarting strategies. The x axis is the running time in a logarithmic scale and 
the y axis is the objective value each time a new upper bound is found. The three 
red curves depict the performance of activity-based search, while the three blue 
curves correspond to impact-based search. What is striking here is the difference 
in behavior between the two searches. Abs quickly dives to the optimal solution 
and spends the remaining time proving optimality. Without restarts, activity- 
based search hits the optimum within 3 seconds. With restarts, it finds the 
optimal solution within one second and the proof of optimality is faster too. IBS 
slowly reaches the optimal solution but then proves optimality quickly. Restarts 
have a negative effect on Ibs. We conjecture that the reduction of large domains 
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Fig. 5. The Radiation-9 Objective over Time. 
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Fig. 6. Frequencies and Distribution of Activity Levels on Radiation-9. 

may not be such a good indicator of progress and may bias the search toward 
certain variables. 

Figure 6 provide interesting data about activities on radiation #9. In par- 
ticular, Figure 6 gives the frequencies of activity levels at the root, and plots 
the activity levels for all the variables. (Only the variables not fixed by single- 
ton arc-consistency are shown in the figures) . The two figures highlight that the 
probing process has isolated a small subset of the variables with very high ac- 
tivity levels, indicating that, on this benchmark, there are relatively few very 
active variables. It is tempting to conjecture that this benchmark has backdoors 
[14] or good cycle-cutsets [2] and that activity-based search was able to discover 
them, but more experiments are needed to confirm or disprove this conjecture. 

6 Conclusion 

Robust search procedures is a central component in the design of black-box 
constraint-programming solvers. This paper proposed activity-based search, the 



idea of using the activity of variables during propagation to guide the search. 
A variable activity is incrcincntcid every time the propagation step filters its 
domain and is aged otherwise. A sampling process is used to initialize the variable 
activities prior to search. Activity-based search was compared experimentally to 
the Ibs and wdeg heuristics on a variety of benchmarks. The experimental 
results have shown that Abs was significantly more robust than both IBS and 
WDEG on these classes of benchmarks and often produces significant performance 
improvements. 
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